Bug localization is a challenging and time-consuming task of the process of bug fixing and, more in general, of software maintenance. Several approaches have been proposed in the literature which support developers in this task by identifying source code files in which the bug is likely to be located. However, the research on this topic never stopped, looking for new methods providing better accuracy and/or better efficiency. In this paper, we propose a two-phase bug localization approach which leverages multi-layer neural networks and distributional features. First phase locations are obtained thanks to a neural network trained on word embeddings representations of fixed bug reports. The second phase refines bug locations taking into account the number of times source code files co-occur in fixed bug locations. To evaluate the approach, we conducted a large-scale experiment on five open source projects, namely Mozilla, Eclipse, Dolphin, httpd, and gcc. Results show that, thanks to pre-trained word embeddings, we were able to implement a scalable approach with a training running time of few hours on large datasets. Performances are comparable to other existing deep learning approaches.

A Two-Phase Bug Localization Approach Based on Multi-layer Perceptrons and Distributional Features / Distante, D.; Faralli, S.. - 11619:(2019), pp. 518-532. (Intervento presentato al convegno 19th International Conference on Computational Science and Its Applications, ICCSA 2019 tenutosi a rus) [10.1007/978-3-030-24289-3_38].

A Two-Phase Bug Localization Approach Based on Multi-layer Perceptrons and Distributional Features

Faralli S.
Co-primo
2019

Abstract

Bug localization is a challenging and time-consuming task of the process of bug fixing and, more in general, of software maintenance. Several approaches have been proposed in the literature which support developers in this task by identifying source code files in which the bug is likely to be located. However, the research on this topic never stopped, looking for new methods providing better accuracy and/or better efficiency. In this paper, we propose a two-phase bug localization approach which leverages multi-layer neural networks and distributional features. First phase locations are obtained thanks to a neural network trained on word embeddings representations of fixed bug reports. The second phase refines bug locations taking into account the number of times source code files co-occur in fixed bug locations. To evaluate the approach, we conducted a large-scale experiment on five open source projects, namely Mozilla, Eclipse, Dolphin, httpd, and gcc. Results show that, thanks to pre-trained word embeddings, we were able to implement a scalable approach with a training running time of few hours on large datasets. Performances are comparable to other existing deep learning approaches.
2019
19th International Conference on Computational Science and Its Applications, ICCSA 2019
Bug localization; Machine learning; Multilayer perceptron networks; Software maintenance; Word embeddings
04 Pubblicazione in atti di convegno::04b Atto di convegno in volume
A Two-Phase Bug Localization Approach Based on Multi-layer Perceptrons and Distributional Features / Distante, D.; Faralli, S.. - 11619:(2019), pp. 518-532. (Intervento presentato al convegno 19th International Conference on Computational Science and Its Applications, ICCSA 2019 tenutosi a rus) [10.1007/978-3-030-24289-3_38].
File allegati a questo prodotto
Non ci sono file associati a questo prodotto.

I documenti in IRIS sono protetti da copyright e tutti i diritti sono riservati, salvo diversa indicazione.

Utilizza questo identificativo per citare o creare un link a questo documento: https://hdl.handle.net/11573/1621517
 Attenzione

Attenzione! I dati visualizzati non sono stati sottoposti a validazione da parte dell'ateneo

Citazioni
  • ???jsp.display-item.citation.pmc??? ND
  • Scopus 1
  • ???jsp.display-item.citation.isi??? 0
social impact